The Identification of Index Terms in Natural Language Object Descriptions
نویسنده
چکیده
"The flowering part, it looks like someone is sticking their tongue out" (a subject's description of Arethusa bulbosa, see Figure 1). The mechanisms that people use in natural settings to describe objects to one another can be used to inform the design of image retrieval and museum systems. The image retrieval problem may be recast as an object description problem where the images are of objects. This study examines the vocabulary and communication constructs that are used by novices and domain experts to describe objects in an object identification task. These human-centered devices may prove to be more understandable and easier to use than some purely computational approaches. The experimental conditions mimic a scenario where a person queries an agent (active botanical information resource) in natural language in order to identify plant images. The analysis identified the objects of discourse (objects, parts and relations) including analogies, exemplars, prototypical shapes and shape modification predicates such as "longer," and "wider." In spoken language novices and horticulturists use descriptive mechanisms similar to that in botanical text but at different frequencies. For example, participants rely heavily on visual analogies to objects both within and outside of the domain. "This looks like a X" where X is a plant (i.e. "daisy") or a non-plant (i.e. "butterfly" or "child's drawing of the sun"). The results suggest that indexing and retrieval systems should provide semantic level similarity mechanisms to allow for whole-object as well as part-wise visual analogy. The systems should also provide a visual vocabulary, a set of images that represent prototypes of the verbal terms collected in this study. Listener's Set[1] Speaker's Set[2] Figure 1: Arethusa bulbosa (Dragon's-mouth)
منابع مشابه
Methodology of Description in Shaykh al – Ishraq
As an ontologist philosopher Shaykh al – Isharq believes in a heirarchegal being on the basis of which presents his classification of various descriptions. These descriptions are various both in terms of longitudinal and latitudinal. That is, for instance though his intative descriptions are at the latitude of his logical analytic descriptions, possesses itself a longitudinal order successiv...
متن کاملFramework for using a Natural Language Approach to Object Identification
Object-oriented analysis and design has now become a major approach in the design of software system. This paper presents a method to automate natural language requirements analysis for object identification and generation based on the Parsed Use Case Descriptions (PUCDs) for capturing the output of the parsing stage. We employ Use-Case Descriptions (UCDs) as input into the whole framework of i...
متن کاملResolving Ambiguous Descriptions through Visual Information
In the context of the SFB (special research group) 360 "Situated Artificial Comunicators" the project "Reference in Discourse" deals with the selection of a specific object from a visual scene in a natural language situation. In the SFB scenario, a robot is instructed by a person to construct a toy airplane. One of the prerequisites to solve this task is the identification of a described object...
متن کاملUsing Prolog for Biological Descriptions
We describe a system which performs biological identification on the basis of natural language descriptions. The system parses texts containing large sets of biological descriptions in restricted natural language and constructs a knowledge base. The system can semi-automatically adapt to a text by extending its lexicon and, to a limited extent, its grammar. Prolog features are important in both...
متن کاملLearning Models for Object Recognition from Natural Language Descriptions
We investigate the task of learning models for visual object recognition from natural language descriptions alone. The approach contributes to the recognition of fine-grain object categories, such as animal and plant species, where it may be difficult to collect many images for training, but where textual descriptions of visual attributes are readily available. As an example we tackle recogniti...
متن کامل